Learning Situation-Dependent Costs: Using Execution to Re ne Planning Models
نویسندگان
چکیده
Physical environments are so complex that it is hard to hand-tune all of the domain knowledge, especially to model the dynamics of the environment. The work presented in this paper explores machine learning techniques to autonomously identify situations in the environment that a ect plan quality. We introduce the concept of situation-dependent costs, where situational features can be attached to the costs used by the path planner. These costs e ectively diagnose and predict situations the robot encounters so that the planner can generate paths that are appropriate for each situation. We present an implementation of our situationdependent learning approach in a real robotic system, Rogue. Rogue learns situation-dependent costs for arcs in a topological map of the environment; these costs are then used by the path planner to predict and avoid failures. In this article, we present the representation of the path planner and the navigationmodules, and describe the execution trace. We show how training data is extracted from the execution trace. We present experimental results from a simulated, controlled environment as well as from data collected from the actual robot. Our approach e ectively re nes models of dynamic systems and improves the efciency of generated plans.
منابع مشابه
Situation-Dependent Learning for Interleaved Planning and Robot Execution
This dissertation presents the complete integrated planning, executing and learning robotic agent Rogue. Physical domains are notoriously hard to model completely and correctly. Robotics researchers have developed learning algorithms to successfully tune operational parameters. Instead of improving low-level actuator control, our work focusses instead at the planning stages of the system. The t...
متن کاملLearning Situation-Dependent Rules: Improving Planning from Robot Execution
Real world robot tasks are so complex that it is hard to hand-tune all of the domain knowledge, especially to model the dynamics of the environment. In this paper we present techniques for learning from real-world robot execution to improve planning and execution performance. We present our work learning from execution to improve a task planner's performance. Our system collects execution trace...
متن کاملSituation ActionPlanner Situation ActionReactivePolicy A ) B ) C ) Situation
Dyna is an AI architecture that integrates learning, planning, and reactive execution. Learning methods are used in Dyna both for compiling planning results and for updating a model of the eeects of the agent's actions on the world. Planning is incre-mental and can use the probabilistic and ofttimes incorrect world models generated by learning processes. Execution is fully reactive in the sense...
متن کاملSimulation-Based Planning for Computer Generated Forces
Planning in AI has been an active research topic for more than thirty years but only recently has it started to move in the direction of combining planning and execution to achieve what is sometimes called as`In-telligent Reactive Planning'. We propose simulation-based planning as a new way to perform intelligent re-active planning. Simulation-based planning |unlike most other planning systems|...
متن کاملSoccer Goalkeeper Task Modeling and Analysis by Petri Nets
In a robotic soccer team, goalkeeper is an important challenging role, which has different characteristics from the other teammates. This paper proposes a new learning-based behavior model for a soccer goalkeeper robot by using Petri nets. The model focuses on modeling and analyzing, both qualitatively and quantitatively, for the goalkeeper role so that we have a model-based knowledge of the ta...
متن کامل